Improved features using convolution-augmented transformers for keyword spotting

نویسندگان

چکیده

Transformer can effectively model long rang dependency, but suffer from uncapable to extract local feature patterns. While CNNs exploit features effectively. In this paper, we seek combine convolution and Transformers improves over using them individually, propose improved convolution-augmented transformers for keyword spotting. The are constructed with a ResNet front-end back-end in series. Using spotting task. results show that the convolution- augmented yield at least 3% improvement compared other features.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Transferable Deep Features for Keyword Spotting

Deep features, defined as the activations of hidden layers of a neural network, have given promising results applied to various vision tasks. In this paper, we explore the usefulness and transferability of deep features, applied in the context of the problem of keyword spotting (KWS). We use a state-ofthe-art deep convolutional network to extract deep features. The optimal parameters concerning...

متن کامل

Keyword Spotting Using Durational Entropy

This paper deals with the task of detection of a given keyword in continuous speech. We build upon a previously proposed algorithm where a modified Viterbi search algorithm is used to detect keywords, without requiring any explicit garbage or filler models. In this work, the concept of durational entropy is used to further discard a large fraction of false alarm errors. Durational entropy is de...

متن کامل

Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback

Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...

متن کامل

Assisted keyword indexing for lecture videos using unsupervised keyword spotting

Many students use videos to supplement learning outside the classroom. This is particularly important for students with challenged visual capacities, for whom seeing the board during lecture is di cult. For these students, we believe that recording the lectures they attend and providing e↵ective video indexing and search tools will make it easier for them to learn course subject matter at their...

متن کامل

Morphological Segmentation for Keyword Spotting

• We explore the impact of morphological segmentation on Keyword Spotting (KWS). ! • Handling out-of-vocabulary (OOV) words is a major challenge in KWS we aim to alleviate this problem by utilizing sub-word units.! • We augment a state-of-the-art KWS system with subword units derived from supervised and unsupervised morphological segmentations, and compare with phonetic and syllabic segmentatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ITM web of conferences

سال: 2022

ISSN: ['2271-2097', '2431-7578']

DOI: https://doi.org/10.1051/itmconf/20224702039